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Abstract. We introduce two-sorted theories in the style of Cook and Nguyen for the 
complexity classes ©L and DET, whose complete problems include determinants over 
Z2 and Z, respectively. We then describe interpretations of Soltys' linear algebra theory 
LAp over arbitrary integral domains, into each of our new theories. The result shows 
equivalences of standard theorems of linear algebra over Z2 and Z can be proved in the 
corresponding theory, but leaves open the interesting question of whether the theorems 
themselves can be proved. 



1. Introduction 

This paper is a contribution to bounded reverse mathematics |Ngu08[ ICN10| . that part 
of proof complexity concerned with determining the computational complexity of concepts 
needed to prove theorems of interest in computer science. We are specifically interested in 
theorems of linear algebra over finite fields and the integers. The relevant complexity classes 
for each case have been well-studied in the computational complexity literature. The classes 
are @L and DET, associated with linear algebra over Z2 and Z, respectively. We introduce 
formal theories V®L and V#L for ©L and DET, each intended to capture reasoning in 
the corresponding class. Each theory allows induction over any relation in the associated 
complexity class, and the functions definable in each theory are exactly the functions in 
the class. In particular determinants and coefficients of the characteristic polynomial of a 
matrix can be defined. 

To study the question of which results from linear algebra can be proved in the theories 
we take advantage of Soltys's theory LAp [SK01, SC04J for formalizing linear algebra over an 
arbitrary field or integral domain. We present two interpretations of LAP: one into V®L 
and one into Vj^L. Both interpretations translate theorems of LAp to theorems in the 
corresponding theory, but the meaning of the theorems differs in the two translations since 
the ring elements range over Z2 in one and over Z in the other. From these interpretations 
and results in [SKOll ISCQ4] we show that the theories prove some interesting properties of 
determinants, but leave open the question of whether the proofs of some basic theorems 
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such as the Caley-Hamilton Theorem can be formalized in the theories. We also leave open 
the question of whether the theories prove simple matrix identities studied in [SK01, S C04] . 
such as AB = I — > BA = I. An affirmative answer would shed light on interesting questions 
in propositional proof complexity concerning the lengths of proofs required in various proof 
systems to prove tautology families corresponding to the identities. 

1.1. The complexity classes. Complete problems for the classes ©L and DET include 
standard computational problems of linear algebra over their respective rings Z2 and Z, such 
as computing determinants, matrix powers, and coefficients of the characteristic polynomial 
of a matrix. (Recently [BKR0 9] proved that for each k > 1, computing the permanent mod 
2 k of an integer matrix is in ®L, and hence complete.) The classes satisfy the inclusions 



(ignoring the distinction between function and language classes) where L and NL are the 
problems accepted in deterministic and nondeterministic log space, respectively. It is not 
known whether ©L and NL are comparable. (In fact no one has been able to disprove the 
unlikely possibility that all of the above classes except AC coincide.) 

The simplest way of defining the classes ®L and DET is using uniform AC reductions: 
Let AC°{f) be the set of functions computable by a uniform family of polynomial size 
constant depth circuits with oracle access to /. Then ®L = AC°{det2) and DET = 
AC {det), where det 2 and det are the determinant functions for matrices over Z2 and Z 
respectively. For the case of Z, integer entries of a matrix are presented in binary. 

The usual definitions of these classes involve counting the number of accepting compu- 
tations of nondeterministic log space Turing machines. Thus j^L is the class of functions 
/ such that for some nondeterministic log space Turing machine M, f{x) is the number 
of accepting computations of M on input x. Then the sets in ®L are those of the form 
{x I f(x) mod 2 = 1} for some / in #L. It turns out that AC {det) = AC°{#L), and 
AC°{det 2 ) = AC°{®L) = ©L |AU96l [BDHM92] . 

DET can also be characterized as the j^L hierarchy j^LB. This is defined as follows: 
#LB 1 = #L, and for i > 1, #LB i+1 = #L* LH \ (The exponent #LBi indicates that a 
function from this class is allowed to be an oracle for the log space machine whose accepting 
computations are being counted). Then A()!)(i shows 



We should clarify that our definition of DET = AC {det) here differs from that given 
in [Coo85| . where DET is defined to be NC l {det), the closure of {det} under the more 
general NC 1 reductions. Allender proved (see the Appendix to AllO f] ) that if AC {det) = 
NC 1 {det) then the j^L hierarchy collapses to some finite level #LBi, something that is 
not known to be true. However the present first author wrote |Coo85j before uniform AC 
reductions had been studied, and now believes that AC reductions are the natural ones 
to use in studying small complexity classes. Evidence for this is that (|1.2p holds when 
DET = AC {det) as we now define DET, and does not hold under the old definition 
(assuming the #L hierarchy is strict). 

The above inclusions (jl.ip compare the class of functions DET with classes of relations. 
Here and elsewhere we sometimes do not explicitly distinguish between a function class FC 



AC C L C ©L C DET C NC 2 C P 
AC C L C NL C DET C NC 2 



(1.1) 




(1.2) 
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and the corresponding relation class C, using the following standard correspondence (which 
applies to all classes we consider): A relation is in C iff its characteristic function is in FC, 
and a function is in FC iff it is polynomially bounded and its bit graph is in C. (The bit 
graph of a function F : {0, 1}* — > {0, 1}* is the relation Bp(i, X) which holds iff the ith bit 
of F(X) is 1.) 

1.2. The theories V®L and V#L. To construct formal theories for the classes ©L and 
DET we follow the framework laid out in Chapter 9 of the monograph [CN10] of Cook and 
Nguyen for defining theories for certain complexity classes between AC and P. All of these 
theories share a common two-sorted (number and string) vocabulary C\ (see Equation l2,ll) . 
The intention is that the number sort ranges over N and the string sort ranges over bit strings 
(more precisely, finite subsets of N). The strings are intended to be inputs to the machine 
or circuit defining a member of the complexity class, and the numbers are used to index bits 
in the strings. Each theory VC for a class C extends the finitely-axiomatized base theory 
V° for AC by addition of a single axiom stating the existence of a solution to a complete 
problem for C. General techniques are presented for defining a universally-axiomatized 
conservative extension VC of VC which has function symbols and defining axioms for each 
function in FC, and VC admits induction on open formulas in this enriched vocabulary. It 
follows from the Herbrand Theorem that the provably-total functions in VC (and hence in 
VC) are precisely the functions in FC. 

Chapter 9 (with earlier chapters) of |CN10j explicitly defines theories for the following 
classes: 

AC C AC°(2) c TC° C NC 1 CLC NL C NC C P (1.3) 
These classes are defined briefly as follows. A problem in AC is solved by a uniform 
family of polynomial size constant depth Boolean circuits with unbounded fanin AND and 
OR gates. AC (2) properly extends AC by also allowing unbounded fanin parity gates 
(determining whether the inputs have an odd number of l's) in its circuits. TC° allows 
majority gates rather than parity gates in its circuits (and has binary integer multiplication 
as a complete problem). NC 1 circuits restrict all Boolean gates to fanin two, but the circuits 
are allowed to have logarithmic depth. Problems in L and NL are solved respectively by 
deterministic and nondeterministic log space Turing machines. NC is defined like NC 1 , 
but the circuits can have polylogarithmic depth (and polynomial size). Problems in P are 
solved by polynomial time Turing machines. 

Our new theories V@L and V#L for ©L and DET extend the base theory V° for 
AC by adding axioms stating the existence of powers A k of matrices A over Z2 and Z, 
respectively. Here k is presented in unary, but for the case of V#L integer entries for A 
are presented in binary. (Matrix powering is a complete problem for these classes). Here 
there is a technical difficulty of how to nicely state these axioms, since neither the parity 
function (needed to define matrix multiplication over Z2) nor integer product and multiple 
summation (needed to define matrix multiplication over Z) are AC functions, and hence 
neither is definable in the base theory V . We solve this by basing an initial version of 
V(BL on the theory V°(2) for AC (2) (which contains the parity function) and basing an 
initial version of Vj^L on the theory VTC° for TC° (which contains integer product). We 
then use results from [CN10] to translate the axioms for the initial versions to the language 
of the base theory V , to obtain the actual theories V®L and V#L. We show that the 
resulting theories satisfy the requirements of Chapter 9 (existence of "aggregate functions" ) 
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that allow the existence of the nice universal conservative extensions V(BL and V#L of 
V®L and VjfL. Using general results from Chapter 9 of |CN10] we show the following (see 
Theorems 13.41 and 13.71 for more formal statements). 

Theorem 1.1. The provably total functions ofV®L and V@L (resp. V#L and V#L) are 
exactly the functions of the class ©L (resp. DET). Further V®L (resp. VjfL) proves the 
induction scheme for £q '(Cf®l) formulas (resp. Y,q(£f#l) formulas. 

The last sentence means in effect that V®L and V#L prove the induction schemes for 
formulas expressing concepts in their corresponding complexity classes. 

The new theories mesh nicely with the theories for the complexity classes in (jl.3p . In 
particular, we have 

V° C V°(2) C VTC° C VNC 1 C VL C V®L C V#L C VNC C VP (1.4) 

We also have VL C K/VL C U#L. We do not know whether VNL and FffiL are 
comparable, because we do not know whether NL and ©L are comparable. 

1.3. The interpretations. Next we study the question of which results from linear algebra 
can be proved in the theories. As mentioned above, we take advantage of Soltys's theory 
LAp |SK0H ISC04] for formalizing results from linear algebra over an arbitrary field or 
integral domain. We present two interpretations of LAp: one into V®L and one into V#L. 
Both interpretations translate theorems of LAp to theorems in the corresponding theory, 
but the meaning of the theorems differs in the two translations since the ring elements range 
over Z2 in one and over Z in the other. 

LAp defines matrix powering, and uses this definition and Berkowitz's algorithm | Ber84| 
to define several functions of matrices, including determinant, adjoint, and characteristic 
polynomial. The following standard principles of linear algebra are discussed: 

(i) The Cayley-Hamilton Theorem (a matrix satisfies its characteristic polynomial). 

(ii) The axiomatic definition of the determinant (the function det(A) is characterized by 
the properties that it is multilinear and alternating in the rows and columns of A, and 
det(I) = 1). 

(iii) The co- factor expansion of the determinant. 

Although it remains open whether LAp can prove any of these, a major result from 
[SKOll ISC04] is that LAp proves their pairwise equivalence. As a result of this and our 
interpretations (Theorems 14.11 and 14. 3 p we have the following. 

Theorem 1.2. V®L proves the equivalence of (i), (ii), and (iii) over the ring Z2, and 
Vj^L proves their equivalence over Z. 

An intriguing possibility is that either V@L or Vf^L could use special properties of Z2 or 
Z to prove its version of the principles, but LAp cannot prove them (for all integral domains 
or fields). For example there is a dynamic programming algorithm involving combinatorial 
graph properties (see the concluding Section [5]) whose correctness for Z might be provable 
in VjfL using combinatorial reasoning with concepts from which are not available in 
LAp^ 

[SK01, SC04] also present the so-called hard matrix identities: 
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Definition 1.3. The hard matrix identities are 

AB = I, AC = I -> B = C 

AB = I, AC = -)■ C = 

AB = I ^ BA = I ( L5 ) 
AB = I ^ A t B t = I 

where A, £?, C are square matrices of the same dimensions, and A 1 is the transpose of A. 

Again it is open whether LAp proves these identities, but LAp does prove that they 
follow from any of the principles mentioned in Theorem 11.21 above . The next result follows 
from this and our interpretations (Theorems 14.11 and 14. 3|) . 

Theorem 1.4. V®L proves that ( fi.5j) over the ring 7*2 follows from any of the three 
principles mentioned in Theorem The same is true for V#L over the ring Z. 

[SK01, SC04J introduce an extension VLAp of LAp, which includes an induction rule 
that applies to formulas with bounded universally quantified matrix variables, and show that 
the three principles mentioned in Theorem [L2] and the four matrix identities are all provable 
in VLAp. These papers claim that these proofs in VLAp translate into proofs in the theory 
V 1 for polynomial time (V 1 extends VP in equation (II. 4|) ) when the underlying ring is finite 
or Q. However Jefabek [Jef05] (page 44) points out that for infinite rings this is not true, 
because the definition given of bounded universal matrix quantifiers only bounds the number 
of rows and columns, and not the size of the entries. To fix this, [Jef05] defines a subsystem 
VLAp - of VLAp, with properly defined bounded universal matrix quantifiers, which still 
proves the three principles mentioned in Theorem 11.21 and the four matrix identities, and 
shows that these proofs translate into proofs in V 1 when the underlying ring is finite or 
Q (and hence also Z). Since V 1 is conservative over VP for universal theorems involving 
polynomial time functions we have the following result. 

Proposition 1.5. |SK01| ISC041 |Je~f05| The theory VP proves the three principles (i), (ii), 
(Hi) and the matrix identity tl.5\) for both the rings Z2 and 7L. 



2. Two-Sorted Theories 

We start by reviewing the two-sorted logic used here and in |CN10| . We have number vari- 
ables x, y, z, . . . whose intended values are numbers (in N), and string variables X, Y, Z, . . . 
whose intended values are finite sets of numbers. We think of the finite sets as binary strings 
giving the characteristic vectors of the sets. For example the string corresponding to the 
set {0,3,4} is 10011. 

All our two-sorted theories include the basic vocabulary C\, which extends the first- 
order vocabulary of Peano Arithmetic as follows: 

C\ = [0,1,+,-, I |, e,<,=i, =2] (2.1) 

The symbols 0, 1, +, • are intended to take their usual meanings on N. Here | | is a function 
from strings to numbers, and the intended meaning of \X\ is 1 plus the largest element of 
X, or if A is empty. (If X = {0,3,4} then \X\ = 5.) The binary predicate € is intended 
to denote set membership. We often write X(t) for t £ X (think that bit number t of the 
string X is 1). The equality predicates =1 and =2 are for numbers and strings, respectively. 
We will write = for both, since the missing subscript will be clear from the context. 
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Number terms (such as x + ((|X| + 1) • |1^|)) are built from variables and function 
symbols as usual. The only string terms based on C 2 A are string variables X, Y, Z, . . ., but 
when we extend C\ by adding string- valued functions, other string terms will be built as 
usual. Formulas are built from atomic formulas (e.g. t = u,t < u,X(t),X = Y) using 
A, V, -. and 3x, Vx, 3X, MX. 

Bounded quantifiers are defined as usual, except bounds on string quantifiers refer to 
the length of the string. For example 3X<t <p stands for 3X(|X|<i A (p). 

We define two important syntactic classes of formulas. 

Definition 2.1. T>q is the class of C\ formulas with no string quantifiers, and only bounded 
number quantifiers. £f formulas are those of the form 3X<t cp, where tp is in Hq and the 
prefix of bounded quantifiers may be empty. 

Notice our nonstandard requirement that the string quantifiers in Y,f formulas must be 
in front. 

We also consider two-sorted vocabularies C D C 2 A which extend C 2 A by possibly adding 
predicate symbols P,Q,R, . . . and function symbols f,g,h,... and F,G, H, . . .. Here /, g,h, . . 
are number functions and are intended to take values in N, and F,G, H, . . . are string func- 
tions and are intended to take string values. Each predicate or function symbol has a 
specified arity (n, m) indicating that it takes n number arguments and m string arguments. 
Number arguments are written before string arguments, as in 

f(x 1 ,...,x n ,Xi,...,X m ) F(xi, . . . ,x n ,X 1 , . . . ,X m ) (2.2) 

The formula classes Y,q(£) and Hf(C) are defined in the same way as Y,q and Sf , but 
allow function and relation symbols from C in addition to C\. 

2.1. Two-sorted complexity classes. In standard complexity theory an element of a 
complexity class is either a set of binary strings or a function / : {0,1}* — > {0,1}*. In 
our two-sorted point of view (Chapter 4 of [CN10]) it is convenient to replace a set of 
strings by a relation R(x,X) of any arity (n,m), and functions are generalized to allow 
both number functions and string functions as in (I2.2p . Each standard complexity class, 
including those in (jl.3p and ©L and #L, is defined either in terms of Turing machines or 
circuit families. These definitions naturally extend to two-sorted versions by representing 
strings (as inputs to machines or circuits) in a straightforward way as binary strings, but by 
representing numbers using unary notation. This interpretation of numbers is a convenience, 
and is justified by our intention that numbers are 'small' and are used to index strings and 
measure their length. 

For example, the (two-sorted) complexity class P (resp. NP) is the set of relations 
R(x, X) recognized by polynomial time (resp. nondeterministic polynomial time) Turing 
machines, with the above input conventions. Thus the relation Prime\{x) (x is a prime 
number) is trivially in P since there are at most x possible divisors to test, and the testing 
can obviously be done in time polynomial in x. The relation Prime2(X) (the number whose 
binary notation is X is prime) is also in P, but this is a major result [AKS04], since the 
testing must be done in time polynomial in the length \X\ of X. 

The class (uniform) AC can be defined in terms of uniform polynomial size constant 
depth circuit families, but it has a nice characterization as those sets recognized by an 
alternating Turing machine (ATM) in log time with a constant number of alternations. 
More useful for us, |Imm99| showed that an element of AC can be described as an element 
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of FO, namely the set of finite models of some first-order formula with a certain vocabulary. 
From this and the ATM definition of two-sorted AC , we have the following important 
results relating syntax and semantics. 

Proposition 2.2. [Representation Theorems] (IV. 3. 6 and IV. 3. 7 of |CN10| ) A relation 
P(x,X) is in AC (respectively NP) iff it is represented by some T,q -formula (respectively 
Ylf -formula) tp(x,X). 

For example the relation PAL(X) (X is a palindrome) is an AC relation because the 
S^-formula Vx,y<|X| (x + y + 1 = \X\ D (X(x) o X(y))) represents it. 

2.2. Special functions. There are several conventional number and string functions in 
FAC° used to encode and retrieve information from bit-strings. Their usage is crucial to 
several technical proofs below, so we here provide definitions of common functions from 
[CN10| as well as extensions useful for our own purposes. 

The pairing function (x,y) = (x + y) (x + y + 1) + 2y allows easy 2-dimensional indexing 
in strings. It can be extended to fe-arity. 

(x 1 ,x 2 , ...,x k ) = (x 1 , (x 2 , x k )} 

We construct the unary relation Pair(z) to be true only of numbers in the range of the 
pairing function. The functions left((x,y)) = x and right ((x,y)) = y reverse the pairing 
function, and are defined to be on numbers z where Pair(z) is false. 

For example a /c-dimensional bit array can be encoded by a string X, with bits recovered 
using the pairing function. By convention, we write: 

X{x\, ...,x k ) = X({xi,. . .,x k )) 

For 2-dimensional arrays, the x th row can be recovered using the Row function, which 
is bit-defined: 

Row(x, Z)(i) <-> i < \Z\ A Z(x, i) 
For notational convenience, we write Row(x,Z) = Z^ X K Thus a 1-dimensional array of j 
strings X\, . , , ,Xj can be encoded in a single bit-string Z, where X{ = Z^. 

It will be useful when dealing with integers (section I3.2|) to be able to encode 2- 
dimensional arrays of strings X% j as a single string Z . To do so, we first encode each row of 

the 2-dimensional matrix of strings as a 1-dimensional array of strings Yj so that K = Xij. 
Then we encode a 1-dimensional array of those arrays: = Y{. The x,y th strings in this 
2-dimensional array Z is recovered with the R0W2 function: Row2(x,y, Z) = Z^ y \ 

Row 2 {x,y,Z)(i) o % < \Z\ A Row(x, Z)(y,i) (2.3) 

The Row and R0W2 functions allow a bit-string Z to encode a list or matrix of other 
bit-strings. We have analogous functions seq and entry that enable string Z to encode a 
list or matrix of numbers (respectively). Thus the numbers yo, y%, y 2 , ■ ■ .encoded in Z can 
be recovered yi = seq(i, Z) = (Z) 1 by convention. 

y = seq(x, Z) <H- 

(y < \Z\ A Z(x, y) A Mz < y,->Z(x,z)) V (V* < \Z\, ->Z(x, z) A y = \Z\) (2.4) 
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Numbers mj are encoded in Z by first arranging them into rows Yi where (Yi) 3 = seq(j, Yi). 
Then Yi = Z^\ The numbers are retrieved using the entry function: y^j = entry(i,j, Z). 

entry Z) = y o (Z^) 3 ' = y 
2.3. The classes ®L and DET. 

Definition 2.3. ®L is the set of relations R(x, X) such that there is a nondeterministic log 
space Turing machine M such that M with input x, X (represented as described in Section 
12. ID has an odd number of accepting computations iff R(x, X) holds. 

#L is the set of string functions F(x, X) such that there exists nondeterministic log 
space Turing machine M, and F(x, X) is the number (in binary) of accepting computations 
of M with input x, X. 

In the above definition we restrict attention to log space Turing machines that halt for 
all computations on all inputs. We think of the binary string F(x, X) as a number, as given 
by the following definition. 

Definition 2.4. A string X represents the number bin(X) if 

bin(X) = ^2 i X(i) 

i 

where we treat the predicate X(i) as a 0-1 valued function. 

We now formalize the correspondence (mentioned in Section [Vj between a complexity 
class C of relations and a complexity class FC of functions. A number function f(x,X) 
(respectively string function F(x, X)) is p-bounded if there is a polynomial g(x, y) such that 
f(x,X) < g(x, \X\) (respectively \F(x,X)\ < g(x, |A"|)). The bit graph of a string function 
F is the relation Bp defined by Bp(i, x, X) F(x, X)(i). 

Definition 2.5. If C is a class of (two-sorted) relations then FC denotes the corresponding 
class of functions, where / (respectively F) is in FC iff it is p-bounded and its graph 
(respectively bit graph) is in C. If C is a two-sorted complexity class of functions, then RC 
consists of all relations whose characteristic functions are in C. 

In general when we refer to a complexity class such as AC or P we refer to the relations 
in the class, and sometimes also to the functions in the corresponding function classes FAC° 
and FP. 

We will consider two-sorted vocabularies C which extend and in all cases each 
function and relation symbol in C has a specific intended interpretation in our standard 
two-sorted model (the two universes being N and the set of finite subsets of N). Thus we 
can make sense of both syntactic and semantic statements about C 

If £ is a two-sorted vocabulary, then / (respectively F) is S^-definable from £ if it is 
p-bounded and its graph (respectively bit graph) is represented by a formula in Eq (£). In 
this case / (or F) can be computed by a uniform polynomial size family of bounded-depth 
circuits with oracle access to the functions and predicates in C 

The following definition is from page 269 of [CN10 . 
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Definition 2.6 (^4C°-reducibility). (IX. 1.1 in [CN10| ) A string function F (respectively, a 
number function /) is ^4C°-reducible to £ if there is a sequence of string functions Fx, . . . , 
F n , (n > 0) such that 

F{ is X^-definable from C U {Fi, . . . , F^i} for i = 1, . . . , n; 

and F (resp. /) is E^-definable from L U {Fi, . . . ,F n }. A relation R is ^4C°-reducible to 
C if there is a sequence of string functions Fx, . . . , F n as above, and R is represented by a 
(£ LI {Fx,..., F n })-formula. 

AC°(C) (the AC closure of C) denotes the closure of £ under ^4C°-reducibility. 

It is easy to see from the above definitions and Proposition 12.21 that AC°(C\) is the set 
of all AC functions and relations. 

From Lemma 6 in [BDHM92] we know that ©L is closed under NC 1 reductions, and 
hence also under the weaker AC reductions, so AC°(®L) = ®L. However we do not know 
whether j^L is closed under AC reductions [A096]. Since our theories can only characterize 
classes closed under AC reductions, we use the class DET instead of #L. 

Definition 2.7. DET = AC°(#L) 

Matrix powering is central to our theories for ©L and DET. So we need to define 
this for each of the two classes as a two-sorted function. For ©L we use the truth values 
{false, true} to represent the elements {0, 1} of Z2, and we represent a matrix over Z2 with 
a string X. For DET we represent integers with binary notation by bit strings (using 
Definition I2.4p , and we represent a matrix over Z by an array of strings (see Section \2.2\i . 
We number rows and columns starting with 0, so if X is an n x n matrix then < i, j < n 
for all entries X(i,j). 

Definition 2.8 (Matrix Powering). Let X be a string representing annxn matrix over Z2 
(resp. Z). Then the string function Poui2{n,k,X) (resp. Powz(n,k,X)) has output X h , 
the string representing the /c th power of the same matrix. 

Proposition 2.9. ©L = AC°(Pow 2 ) and DET = AC°(Pow z ). 

Proof. For DET, by Definition [277] it suffices to show AC°(Pow z ) = AC°(#L), and this 
is proved in [Fon09j. A proof can also be extracted from the earlier literature as follows. 
Let IntDet(n, X) be the function that returns the determinant of the n x n integer matrix 
X. By Berkowitz's algorithm |Ber841 ISC04] . IntDet is AC reducible to Powz, and by a 
standard reduction (see for example the proof of Proposition 5.2 in |Coo85j ) Powz is AC 
reducible to IntDet. Hence it remains to show that AC (IntDet) = AC°(#L), and this is 
stated as Corollary 21 in [AD 96J. A key idea is to show that the problem of counting the 
number of paths of length at most a parameter p from nodes s to t in a directed graph is 
complete for #L. 

The fact that ©L = AC°(Pow 2 ) follows from Theorem 10 in |BDHM92j . (That theorem 
is stated in terms of NC 1 reductions, but the needed reductions are easily shown to be AC .) 

□ 

Although Powz(n,k,X) allows the entries of the matrix X to be arbitrary binary 
integers, it turns out that the function is still complete for DET when the argument X 
is restricted to matrices with entries in 0, 1 (see for example the proof of Lemma 16 in 
|Fon09| K 
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2.4. The Theories V°, V°(2), and VTC°. The the ory V° for AC is the basis for ev- 
ery two-sorted theory considered here and in [CN10| . It has the two-sorted vocabulary 
C\, and is axiomatized by the set 2-BASIC (Figure [1]) of axioms consisting of 15 Eq 3 for- 
mulas expressing basic properties of the symbols of C\, together with the following 
comprehension scheme. 

S^-COMP : 3X<y Vz<y (X(z) o <p(z)) 
Here f(z) is any formula with no free occurrence of X. 

Bl. x + B2. x + l = y + lDx = y 

B3. x + = x B4. x + (y + 1) = (x + y) + 1 

B5. x-0 = B6. x ■ (y + 1) = (x ■ y) + x 

B7. (x < y Ay < x) D x = y B8. x < x + y 

B9. < x BIO. x < y Vy < x 

Bll. x <y o x < y + 1 B12. i ^ D 3j/ < x(y + 1 = x) 

LI. X(y)Dy<\X\ L2. y + 1 = |X| D X(y) 
SE. (|X| = \Y\ A Vi < |X|(X(z) o d X = Y 

Figure 1: 2-BASIC axioms 

V° has no explicit induction axiom, but nevertheless the induction scheme 
Sf-IND : (p(0) A \/x(ip(x) D <p(x + 1))) D Vzi^(z) 

for formulas v , ( a; ) is provable in 1/°, using S^-COMP and the fact that \X\ produces 
the maximum element of the set X. 

Definition 2.10. A string function F(x,X) is T>f -definable (or provably total) in a two- 
sorted theory T if there is a Tif formula tp(x, X, Y) representing the graph Y = F(x, X) of 
F such that T h \/x\/X 3\Y <p{x, X,Y). Similarly for a number function f(x,X). 

It is shown in Chapter 5 of [CN10| that V° is finitely axiomatizable, and the In- 
definable functions in V comprise the class FAC° (see Definition 12.51) . 

The definition in |CN10| of the theory V°(2) for the class AC (2) is based on y° and 
an axiom showing the definability of the function Parity(x,Y). If Z = Parity(x,Y) then 
Z(z) holds iff 1 < z < x and there is an odd number of ones in Y(0)Y(1) . . . Y(z — 1). The 
graph of Parity is defined by the following formula: 

Sparity(x, Y, Z) = -.Z(O) A Vz<x (Z(z + 1) O (Z(z) Y(z))) 

Definition 2.11. [CNlOj The theory V°(2) has vocabulary C\ and axioms those of V° and 
3Z<X+l 5 p ar ity (x,Y,Z). 

The complexity class FAC°(2) is the AC closure of the function Parity(x,Y), and in 
fact the S^-definable functions of V°(2) are precisely those in FAC°(2). 

The theory VTC° for the counting class TC° is defined similarly to V (2), but now 
the function Parity(x,Y) is replaced by the function numones(y, X), whose value is the 
number of elements (i.e. 'ones') of X that are less than y. The axiom for VTC° is based 
on a T> B formula Snum(u, X, Z) defining the graph of a string function accumulating the 
values of numones(y, X) as y increases. 
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Definition 2.12. [CNlOj The theory VTC° has vocabulary C\ and axioms those of V 
and 3Z<l+{y,y) 5 NUM (y,X,Z). 

The class FTC is the AC closure of the function numones, and in fact the In- 
definable functions of VTC° are precisely those in FTC . 

In Chapters 5 and 9 of [CNlOj it is shown that the theories V°, V°(2), VTC° have 
respective universally axiomatized conservative extensions V°, V°(2), VTC° obtained by 
introducing function symbols and their defining axioms for all string functions (and some 
number functions) in the corresponding complexity class. These have the following proper- 
ties. 

Prop osition 2.13. Let (FC, V,V) be any of the triples (FAC°, V°,V°~) or (FAC°(2), 
V°(2),V°(2)) or (FTC°,VTC°,VTC°), and let C be the vocabulary ofV. 

(i) V is a universally axiomatized conservative extension of V , 

(ii) the T,f -definable functions of both V and V are those in FC , 

(iii) a string function (respectively number function) is in FC iff it has a function symbol 
(respectively term) in C, 

(iv) V proves the (£)-IND and £^(£)-COMP schemes, 

(v) for every Y,f(C) formula ip + there is a T,f formula ip such that V h <p + o (p. □ 

3. The New Theories 

When developing theories for ©L and jfL it will be more convenient to work first with ©L, 
where each number can be represented by a single bit, before adding the complication of 
multi-bit numbers in #L. 

3.1. The Theory V@L. The theory V®L is an extension of V°(2) (Definition EjT]) ob- 
tained by adding an axiom showing the existence of matrix powering over Z2. We use the 
fact that ©L is the AC closure of this matrix powering function (Proposition I2.9| ) and the 
development in Chapter 9 of |CN10] to show that the provably total functions in V®L are 
exactly the set FffiL. 

The actual method followed below of describing the theory V®L is slightly more compli- 
cated. In order to obtain the desired axiom for matrix powering (i.e., the function P0W2 in 
Definition 12. 8p . we will work with the theory V°(2), a conservative extension of V°(2), and 
its vocabulary £>fac (2)i wn i cn has function symbols or terms for every function in FAC°(2) 
(Proposition 12. 13]) . We describe the Ef axiom for matrix powering as a Sf {Cfac°(2)) for- 
mula and refer to part (v) of proposition 12.131 to conclude that this formula is provably 
equivalent to a T,f(C\) formula. Hence we will freely use FAC° functions and the function 
Parity (x,Y) when describing formulas which will help express the axiom. 

Each string function F(x,X) (other than Parity{x,Y)) in the vocabulary of V°(2) has 
a defining axiom specifying its bit graph as follows: 

F(x, X)(z) -H- z < t(x, X) A cp(z, x, X) 

where t is an C\ term bounding the function and the formula tp is in the previously 
defined symbols. 
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In order for a formula to encode matrix powering, it will need to include witnesses - 
enough information to "check" that the matrix has been powered correctly. The function 
PowSeq 2 computes every entry of every power of matrix X up to the k th power. 

Definition 3.1 (PowSeq 2 ). Let X be a string representing an n x n matrix over 7* 2 , and let 
X 1 be the string representing the i th power of the same matrix. Then the string function 
PowSeq 2 (n, k, X) has output the list of matrices [ID(n),X, X 2 , . . . , X }, coded as described 
in Section [2.2i 

Note that the functions Pow 2 and PowSeq 2 are ^4C°-reducible to each other, so both 
are complete for F@L. Our axiom for V@L states the existence of values for PowSeq 2 . 

We start by defining the matrix product operation. The AC string function ID(n) 
codes the n x n identity matrix, and has the defining axiom (recall Section [2. 2\i 

ID{n)(b) O b < (n,n) A Pair(b) A left(b) = right(b) 

For X and Y encoding two matrices, the AC string function G(n, i, j, X, Y) codes the 
string of pairwise bit products of row i of X and column j of Y: 

G(n,i,j,X,Y)(b) <-> b < n A X(i, b) A Y(b,j) 

We define 

PAR(X) = Parity (\X\,X)(\X\) (3.1) 
so that PAR(X) holds iff X has an odd number of ones. Thus the function Prod 2 (n, X, Y) 
producing the product of two n x n matrices X and Y over Z 2 , has bit graph axiom: 

Prod 2 (n,X,Y)(b) <H- b < (n, n) A Pair(b) A left(b) < n A right (b) < n 

APAR(G(n, left(b), right(b),X, Y)) (3.2) 
This yields a Hq {Cfac°(2)) formula Sp ow seq 2 ( n 7 k, X, Y) for the graph of PowSeq 2 (n, k, X, Y): 

Y [0] = ID{n)AVi<k (Y [i+1] = Prod 2 {n, X,Y^) (3.3) 
AV6<|y| (Y(b) d (Pair (6) A kft(b) < n)) 

The second line ensures that Y is uniquely defined by specifying that all bits not used in 
encoding matrix entries must be false. Note that 5p wSeq 2 involves the function Prod 2 and 
hence is not equivalent to a formula, but by part (v) of Proposition 12.131 V°(2) proves 
it is equivalent to a Sf formula S' PowSeq2 . 

Definition 3.2. The theory V®L has vocabulary C\ and axioms those of V°{2) and the 
Ef formula 3Y<l+(k, (n, n)) 5' PowSeq {n, k, X, Y) stating the existence of a string value for 
the function PowSeq 2 (n, k, X). 

Following the guidelines of Section IX. 2 of [CNlflj, we prove that several functions, 
including the aggregate function PowSeq 2 (see (|3.5p and (|3.6p below), are Sf -definable in 
V(BL. 

Lemma 3.3. The functions Pow 2 , PowSeq 2 , and PowSeq 2 are Y<f -definable (definition 
UTTW in V@L. 

Proof. Since V°(2) is a conservative extension of V°(2), it follows that the theory 

T = V@L + F°(2) (3.4) 
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is a conservative extension of V(BL, and this allows us to reason in T to make inferences 
about the power of V(BL. Thus by part (v) of Proposition 12.131 to prove that a function is 
Sf -definable in V®L it suffices to prove that it is Sf (£_p J 4 C o( 2 ))-definable in T ■ 

For the function PowSeq 2 the graph is given by either of the equivalent formulas 5 p ow Seq 2 
or S' PowSeq . Existence of the function value follows immediately from the axiom given in 
Definition 13.21 and uniqueness is proved by induction (justified by part (iv) of Proposition 
I2.13P on the Sq (£fac°(2)) formula stating that the first i bits of the function's output are 
unique. 

For Pow 2 the function value can be extracted from the value for PowSeq 2 by an AC 
function, and existence and uniqueness of that value follows from existence and uniqueness 
for PowSeq 2 . 

Recall from chapter 8 of [CN10] (Definition VIII. 1.9) that the aggregate function 
PowSeq 2 is the polynomially bounded string function that satisfies 

\PowSeq*{b,W l ,W 2 ,X)\ < (b, (\W 2 \, |Wi|»> (3.5) 

and 

PowSeq* 2 (b,W 1 ,W 2 ,X)(i,v) ^ i < b A PowSeq 2 ((W{f , (W 2 y , X^)(v) (3.6) 
The strings W\, W 2 , and X encode 6- length lists of inputs to each place of PowSeq 2 : W\ en- 
codes the list of numbers no, ni,n 2 , . . . , n&_i; W 2 encodes the list of numbers ko,ki,..., &&-i; 
and X encodes the list of strings Xq, X\ , . . . , -X&_i . Our goal is to raise the rii x m matrix en- 
coded in Xi to the powers 1,2, ... ,ki. The string function PowSeq 2 computes all these lists 
of powers, aggregating many applications of the function PowSeq 2 . This is accomplished by 
forming a large matrix S by placing padded versions of the matrices Xq,X\, . . . , Xb_± down 
its diagonal, and noting that the ith power of Xj can be extracted from the ith power of S. 
The padding is necessary to make the diagonal matrices all the same size, so the location of 
Xj in S can be computed with AC (2) functions. The details of a formula for the graph of 
PowSeq 2 and the proof that T can prove its unique output exists can be found in appendix 

m □ 

Section IX. 2 in |CN10| presents a general method for defining a universal conservative 
extension VC (satisfying the properties of Proposition 12.131) of a theory V C over L\. Here 
VC is assumed to be defined in a manner similar to V (2) and VTC°; namely by adding 
a Tjf axiom to V° showing the existence of a complete function Fq (and its aggregate F*) 
for the complexity class C. Although our new theory V®L fits this pattern, for the purpose 
of defining V®L it is easier to start from the conservative extension T of V®L defined in 

(HID. 

We now view T as the extension of V°(2) obtained by adding the axiom 

3Y<l+(k, (n, n)) 6 PowSeq2 (n, k, X, Y) 
where 5p ow s e q 2 is the original version of S' PowSeq given in (|3.3p , (Note that this axiom is 
equivalent to the one in Definition 13.21 ) 

A small modification to the development in Section IX. 2 generalizes it to the case in 
which the base theory is an extension V of V° rather than just V° (in this case V = T). 
The construction of VC works so that Proposition 12.131 holds, where now the complexity 
class C is the AC closure of {C,F}, where L is the vocabulary of V and F is the func- 
tion whose existence follows from the axiom. In the present case, this allows us to define 
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V®L satisfying Proposition 12.131 where now the complexity class C is the AC closure of 
{Afac°(2)> PowSeq 2 }, which is same as the AC closure of {PowSeq 2 }, namely ©L. 

To start this construction we need a quantifier-free axiomatization of T = V(BL+V°(2). 
This consists of the axioms for V°(2) (which are quantifier- free) together with a quantifier- 
free defining axiom for PowSeq 2 . The formula (I3.3P for b~p ow s e q 2 (n, k, X, Y) (the graph of 
PowSeq 2 ) has bounded number quantifiers, but these may be eliminated using functions in 
FAC°(2) (see Section V.6 of |CN10| ). Thus 5p ow Seq 2 {n,k,X,Y) is provably equivalent to 

a quantifier- free formula 5 PowSeq 2 

(n,k,X,Y) over the vocabulary C F ac°(2) f° r V"°(2). The 
required quantifier-free defining axiom is 

Y = PowSeq 2 (n,k,X,Y) O 5" PowS eq 2 (n, k, X, Y) 

As a result of this development we have a version of Proposition 12.131 for V@L: 

Theorem 3.4. Let Cf®l be the vocabulary ofV@L. 

(i) V®L is a universal conservative extension of V®L, 

(ii) the T,f ' -definable functions of both V®L and V@L are those of F®L, 

(iii) a string function (respectively, number function) is in F®L iff it has a function symbol 
(resp., term) in Cf®l, 

(iv) V®L proves the E^(£f ffl i)-IND and T<q (£f®l)-CO~M.P schemes, and 

(v) for every T,f \Cf®l) formula ip + there is a £f formula cp such that V®L h ip + f-> (p. □ 

3.2. The Theory V#L. Our theory VjfL is associated with the class DET = AC°(#L) 
(Definition I2.T|) . Just as V®L is an extension of V°(2), VffL is the extension of VTC° 
(Definition 12. 12|) obtained by adding an axiom showing the existence of matrix powering for 
matrices with binary integer entries. We use the fact that DET is the AC closure of this 
matrix powering function (Proposition 12. 9| ) and the development in Chapter 9 of [CN10| to 
show that the provably total functions in V#L are exactly the set FDET. 

The development of V#L is similar to that of V(BL, but has extra complications. Before 
defining matrix multiplication and powering we must define multiplication and iterated sum 
for binary integers. Both of these functions are complete for the complexity class TC°, so we 
define V#L as an extension of the theory VTC° (recall that V(BL is defined as an extension 
of V°(2)). To define the required functions we work in the conservative extension VTC° 
of VTC° (just as we worked in the conservative extension V°(2) of V°(2) when developing 
V®L). 

We represent integers by strings. For a string X encoding an integer x, the first bit 
X(0) indicates the sign of x: x < iff -^(0). The rest of string X is a binary representation 
of x, from least to most significant bit. The magnitude of an integer can be extracted with 
the FAC° function intsize(X), which simply deletes the low order bit of the string X. Thus 
an integer can be recovered from its string encoding by computing • intsize(X). 

The theory V° defines binary addition X +n Y over natural numbers (Example V.2.5 
in [CN10| ). and VTC° defines binary multiplication X Y over natural numbers and 
Sumf$(n, X) (the sum of the list jl ',..., j'"" 1 ! of natural numbers) (Section IX. 3. 6 in 
[CNlOj ). These functions over N can be used to help develop quantifier- free formulas over 
C F tc° which define the corresponding functions +z> Xj, and Sum% over Z. For example 
the function xpj can be used to define the graph R Xz (X, Y, Z) by 

intsize(Z) = intsize(X) x N intsize(Y) A (Z(0) O (X(0) © Y(Q))) 
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The definition of integer addition +1 is more complicated and requires defining the 'borrow' 
relation for subtraction as well as using +pj. The definition of Sumz(n, X) (iterated integer 
sum) splits the list X into a list of positive integers and a list of negative integers, sums 
each using Sum?q, and subtracts the second sum from the first. 

Matrices of integers are represented by arrays of strings using the R0W2 function (|2.3p . 
Integer matrix product is the TC° function Prod%{n, X, Y) defined analogously to formula 
(|3.2p for Prod,2, and the graph of iterated matrix product is defined by the T,q \C ftc o) 
formula 5p 0W s e q z (n,k,X,Y) analogous to 5p ow s e q 2 ()3.3[) . (See [Fon09j) for more details.) 

By part (v) of Theorem 12.131 VTC° proves 5p ow s e q z is equivalent to a Xf formula 
^PowSeq z i which we use in the following axiom for V#L. 

Definition 3.5. The theory Vj^L has vocabulary C 2 A and axioms those of VTC° and the 
T,f formula 3Y<t o~p ow s e q z ( n > ^> X, Y) for a suitable bounding term t. 

The next two results are proved similarly to Lemma 13.31 and Theorem 13.41 

Lemma 3.6. The functions Powz, PowSeq z , and PowSeq z are T,f -definable in V#L. □ 

Theorem 3.7. Let Cf#l be the vocabulary ofV#L. 

(i) VffL is a universal conservative extension of V#L, 

(ii) the T,f -definable functions of both V#L and V#L are those of FDET, 

(iii) a string function (respectively, number function) is in FDET iff it has a function 
symbol (resp., term) in Cf#l, 

(iv) V#L proves the £^(£p #L )-IND and E^(£ F#L )-COMP schemes, and 

(v) for every Sf (Cf#l) formula (p + there is a T,f formula ip such that VjfL h ip + H^.D 



4. Interpretations of LAp 

Having established the theories V®L and V#L, we are interested in studying which results 
from linear algebra are provable in these theories. This job is made much easier by taking 
advantage of Soltys' theory LAp [SK01} ISC04] . which formalizes linear algebra over an 
arbitrary field (or integral domain) . As explained in Section [TJ this theory defines standard 
matrix functions such as determinant, adjoint, and characteristic polynomial, in terms of 
matrix powering, and formalizes relative proofs of their properties. We show that V®L 
and V#L prove these same properties by interpreting LAp into each of them. The two 
interpretations are different because the intended semantics are different: the underlying 
rings are respectively Z2 and 7L. 

Now Theorems 11.21 and 11.41 in Section [1] follow from the fact that each interpretation 
translates theorems of LAp into theorems in the theory. Actually there are exceptions to 
this preservation in the case of V#L, because the underlying ring Z is not a field. However 
as explained at the beginning of Section \4. 3\ Theorems II .21 and II. 41 do indeed hold for Vf^L. 

4.1. Denning LAp. Soltys' theory LAp (for Linear Algebra with matrix Powering) is a 
quantifier-free theory based on Gentzen-style sequents. It has three sorts: indices (i.e. 
natural numbers) (represented by i, j, k), field elements (a, b, c), and matrices (A, B, C) 
with entries from the field. 
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The language £lAp of LAP has symbols 

Oindex, lindex, "hndex, *index, index' rem, r, C, 

Ofield, Ifield, +field, *field, —field, ~ , X^J 

lindex, — index, — field, — matrix, COIldindcx, COndfield; P 

The intended meanings of 0, 1, +,*, , and —field are obvious. The symbol — index is 
cutoff subtraction; div(i, j) and rem(i,j) are the quotient and remainder functions; t(A) 
and c(A) return the numbers of rows and columns in A; e(A,i,j) is the field element Ay, 
(where Ay- = if i = or j = or i > r(A) or j > c(A)), ^2(A) is the sum of all the entries 
of A; and for a a formula, condi n dex(«, h j) is £ if a is true and j otherwise (similarly for 
cond ne id(a, a, b)). The powering function p(n, A) = A n . 

Terms and quantifier-free formulas are mostly constructed in the usual way, respecting 
types. We use n, m for index terms, t, u for field terms, T, U for matrix terms, and a, (3 for 
(quantifier-free) formulas. The four kinds of atomic formulas are m <mdex n , m =index n, 
t =fieid u, and T = ma trix U. Formulas are built from atomic formulas using A, V,->. There 
are restrictions on terms beginning with cond: If a is a formula with atomic subformulas 
all of the form m <mdex n and m =i n dex n, then condi n dex(o, m', n') is a term of type index 
and condgeid(«, t, u) is a term of type field. 

A term of type matrix is either a variable (A, B,C,- ■ • ) or a lambda term of the form 
Xij (to, n, t) (with the restriction that i and j are not free in the index terms m and n). This 
lambda term defines an m x n matrix with (i,j) th entry given by t(i,j). 

Lines in an LAp proof are Gentzen-style sequents a\,...,ak — > f3%, ■ ■ ■ , Pi with the 
usual meaning f\cti Z> V /3j . The logical axioms and rules are those of Gentzen's system LK 
(minus the quantifier rules). The nonlogical axioms are numbered Al through A36 (see 
Appendix [B]) . 

There are two nonlogical rules. The first nonlogical rule is the Induction Rule: 

r,q(z) -> q(i + 1), A 
T,a(0) -> a(n), A 1 ' ' 

The second nonlogical rule is the Matrix Equality Rule, which states that two matrices are 
equal if they have the same numbers of rows and columns, and have equal entries: 

T^A,e(T,i,j)=e(U,i,j) 
r -> A,r(T) = r(U) (4.2) 



c c c "5*1 

*2 ^3 , a 

where 02 



r -> A, T - matrix U 5 3 : T ^ A, c(T) = c(U) 

Many matrix functions such as multiplication, addition, transpose, can be defined using 
A terms, avoiding the need for separately defined function symbols. We use the following 
abbreviations for defined terms. 

Integer maximum max{i, j} := cond(i < j,j,i) 

Matrix sum A + B := Xij (max{r( A), v(B)}, max{c(A), c(B)}, A {j + By) 

Note that A + B is well defined even if A and B are incompatible in size, because of 
the convention that out-of-bound entries are 0. 

Scalar product aA := \ij(r(A),c(A),a * A^) 

Matrix transpose A 1 := Xij(c(A),r(A),Aji) 

Zero and Identity matrices 

Ofcz := Xij(k, 1,0) and I k := Xij{k, k, cond(i = j, 1, 0)) 
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Sometimes we will just write and / when the sizes are clear from the context. 
Matrix trace tr(A) := T,\ij(r(A), I, An) 

Dot product A - B := £Aij(max{r(yl), r(B)}, max{c(>l), c(B)}, Ay * By) 
Matrix product 

A * B := \ij{T(A), c(B), \kl(c(A), 1, entry(A, i, k)) ■ XM(t(B), 1, entry(B, k,j))) 

Matrix product is defined even when c(A) ^ r(-B), again by the convention that out- 
of-bound entries are 0. 

Finally, the following decomposition of an n x n matrix A is used in the axioms defining 
T,(S) and in presenting Berkowitz's algorithm: 

A =( a s m) (") 

where an is the (1, 1) entry of A, and R, S are 1 x (n— 1), (n— 1) x 1 submatrices, respectively, 
and M is the principal submatrix of A Therefore, we make the following precise definitions: 

R(A) := Xij{l, c(A) - 1, entry(A, 1, i + 1)} 

S(A) := \ij(x(A) - 1, 1, entry (A, i + 1, 1)} (4.4) 
M(A) := \ij{r(A) - l,c(A) - 1, entry (A,i + 1, j + 1)) 

4.2. Interpreting LAp into V@L. Here we take the underlying field in the semantics of 
LAp to be Z2. We interpret the three-sorted theory LAp into the two-sorted theory V@L. 
The index sort is interpreted as the number sort, field elements are interpreted as Boolean 
values, and matrices are interpreted as strings. We translate each formula a of LAP into a 
formula a a of V@L. Here a u is in \Cf®l)i so by part(v) of Theorem l3.4l a° is equivalent 
to a T,f formula (a cr )' of V®L. The translation preserves provability (sequent theorems 
are translated to sequent theorems) and it also preserves truth in our intended standard 
models: (N, Z2, matrices over Z2) for LAp and (N, finite subsets of N) for V@L. 

In order for Theorems 1 1 . 2 1 and 1 1 . 41 (for the case of V®L) to follow from this interpretation 
we need that a term T of type matrix in LAp, when interpreted as a matrix over Z2, is 
translated to a string term T a in V®L which represents the same matrix, using conventions 
like those developed in Sections 12.21 and I3.ll In particular the matrix entries (elements of 
Z2) are certain bits in the string T CT , which is why we represent the elements 0, 1 of Z2 as 
the Boolean values _L and T. 

Recall that formulas of LAp are built from atomic formulas using the Boolean con- 
nectives A, V,— 1. Below we show how to translate atomic formulas. Then each formula a 
is translated by translating its atomic formulas and putting them together with the same 
connectives. Thus (a A f3) a = a a A (a V = a a V and (^a) CT = ^a a . Finally, a 
sequent of LAp is translated to a sequent of V®L formula by formula. Thus the translation 
of ai, . . . ,a k ->■ Pi, ■ ■ ■ ,Pe is of,. . . ,a% ->• /3f, . . . , >9f 

The atomic formulas of LAp are simple, since the only predicate symbols are < and the 
three sorts of =. Thus the main work in defining the interpretation comes in translating the 
three sorts of terms: m to m ff , t to t a , and T to T a . In order to do this we define function 
symbols in the language of V®L to interpret functions in LAp: — , /<ji v , /rem; /r, fc and 
functions and for certain formulas tp. All of these functions have definitions and 
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are definable in the subtheory V° of V@L. However to interpret £ and P we need to go 
beyond V°. 

Defining £ requires the parity function PAR(X), which is definable in the theory V°(2). 
We note that the theory LAp is the extension of the base theory LA obtained by adding the 
function P and the two axioms A35 and A36 defining matrix powering. Our interpretation 
translates LA into V (2). Conveniently V®L is obtained from V°{2) by adding the axiom 
defining PowSeq 2 (defining matrix powering). 

The next three subsections give inductive definitions for translating LAp terms of each 
of the three sorts. 



4.2.1. Index sort. The table below shows how to translate each index term m of LAp into a 
term m a in V@L, except we postpone to Subsection 14.2.31 translating terms involving r and 
c, which specify the numbers of rows and columns in a matrix. The graph of each function 
symbol used in rrf is defined by a £^ formula (except for the case condjndex^ m i n )i f° r 
which the formula is in £j^(a ')). These formulas are given in the table below. (The first 
three lines give the base case of the inductive definition of the translation.) 



LAp 

Oindex 
1 index 

i 

m + index n 
m * index n 
m—. , n 

index 

div(m, n) 




1 

i 

m" + n a 
m a ■ n a 
m a —n a 



fdiv(m° 



index variables map to number variables 

' — 'is standard limited subtraction, defined: 
x—y = zo(x = y + z)V(z = OAx<y) 



rem m, n 



cond ind ex(a,"i,n) 



/re 
fa 



^rrf 



rr 



/div is a number function with graph: 
/div(^) y) = z ^ (y ■ z < x Ax < y ■ (z + 1)) 
V(y = Az = 0) 

/rem is a number function with graph: 

frem{x, y) = Z -H- Z + y ■ f& v (x, y) = X 

where for certain formulas if, /« is defined: 

U(x, y) = z o (ip A x = z) V (-up Ay = z) 

Note that in LAp, condi n d ex ( a > m i n ) is defined only when all atomic sub-formulas of a have 
the form m = n or m < n. Thus f v (x, y) need only be defined when ip is the interpretation 
of such a formula from LAp. 



m , n 



4.2.2. Field sort. Since we represent elements of 7L<i by Boolean values in V(BL, each field 
term t is translated to a T,q(Cf(bl) formula t a . (This convenient representation of ring 
elements as truth values is not possible for the ring Z, complicating section 14.31 ) Each 
variable of type field is interpreted as a formula specifying the first entry of a 1 x 1 matrix 
variable. 

The table below shows how to interpret variables of type field (except those involving 
the matrix entry function e and the matrix sum function E, which are given in Subsection 

MM. 
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For t and u terms of type field, we interpret: 



LAP V®L 

Ofield -L 

Ifield T 

a X a (l, 1) X a is a string variable 

t + fie id u t a (Bu a that is, t a XOR u a 

t -field u t a ®u a 

t *g c id u t a An" 

t- 1 t a 



condfi c i d (a,t,n) (a a A t a ) V (^a CT A u a ) 

4.2.3. Matrix sort. The translation of terms involving matrices is complicated. Every ma- 
trix of LAp has three attributes: number of rows, number of columns, and matrix entries 
(field elements). In our two-sorted language for V®L we represent a matrix by a string 
which codes all of these. Thus an a x b matrix A is represented by a string A a such that 
A a (0, (a,b)) is true, and for all i,j with 1 < i < a and 1 < j < b and e(A,i,j) = A{j = 1, 
the bit A a (i,j) is true. All other bits of A a are false. 

We will use the formula isMatrix2(X) (equivalent to a Y<q formula) which asserts that 
the string X properly encodes a matrix as above. We allow the number of rows and/or 
columns to be 0, but any entry out of bounds is (a false bit). 

isMatrix 2 {X) = 3x,y<\X\ [X(0,(x,y))A (4.5) 

\fz<\X\ ((z = V ->X(z, 0)) A (z = (x, y) V -lX(0, z))) A 

[X(z) D ( Pair(z) A left(z) < x A (left(z) = V right(z) < y))]] 

The following table completes the inductive definition of the translation of all LAp 
terms. (Here PAR(X) is defined in (|3.1f) and the functions f r , f c , F v , F p are defined below).) 



LAp V®L 

A A matrix variables map to string variables 

r(T) / r (T CT ) 

c(T) f c (T°) 



e(T,m,n) T a (m a ,n CT ) A m a > A n a > A isMatrix 2 {!T' J ) 

J2(T) - PAR{T a ) A isMatrix 2 (T a ) 
Xij(m,n,t) F t v(m a ,n a ) 
p(m,T) F p (m a ,T a ) 
The above translation is designed so that for every LAp matrix term T (except a matrix 
variable A), isMatrix2(T a ) holds and is provable in (V©L). 

The V®L functions f r (X) and f c (X) extract the numbers of rows and columns of the 
matrix coded by X, and are used in the table above to translate the LAp terms r(T) and 
c(T). These have defining equations 

f r (X) = z o (^isMatrix 2 (X) A z = 0) V (isMatrix 2 (X) A 3y<\X\ X(0, (z,y))) 

f c (X) = z o (-^isMatrix 2 (X) A z = 0) V (isMatrix 2 (X) A 3y<\X\ X(0, {y, z))) 
Note that strings not encoding a matrix are semantically interpreted as the 0x0 matrix. 
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The translation of the matrix entry term e(T, m, n) is consistent with the LAp conven- 
tion that rows and columns are numbered starting with 1. A string not properly encoding 
a matrix has no non-zero entries. 

The translation of the entry sum term £^(T) is _L if T a does not properly encode a 
matrix, and otherwise it is the parity of 1 + the number of one-bits in T a (the extra bit is 
T°"(0, {a, b)) coding the number of rows and columns). 

The matrix term \ij {m, n, t) is interpreted by the V@L term Ff (m a , n a ) . Here Ft* (x, y) 
is a string function, which has additional arguments corresponding to any free variables in 
t a other than the distinguished variables i,j (we interpret i a = i and j a = j). The bit 
defining formula for Ff is 

Ft* (x, y)(b) O b = (0, (x, y)) V 3i<x 3j<y (i > Aj > A b = A t°(i, j)) 

where t a (i,j) indicates the distinguished variables i,j. Then isMatrix2(Ft<? (m a ,n <T )) is 
always true (and provable in V©L)). 

The last line in the table above translates the LAp term p(m,T) representing matrix 
powering to the V®L term F p (m a ,T CT ). The function F p is defined in terms of P0W2 
(Definition I2.8|) which defines matrix powering in V®L. But the definition of F p is not 
straightforward, because our translation of LAp terms includes the extra bit in row coding 
the matrix dimension. 

Thus we need two Eq 3 string functions. The first one strips the dimensions from the 
string, converting a matrix encoded according to our interpretation (of LAp into V®L) into 
a matrix according to the standard in V®L. 

Strip(X)(i,j)^X(i + l,j + l) 

The second function adds the dimension-encoding "wrapper" back, converting a standard- 
form V®L matrix into the form of an interpreted matrix from LAP. 

Wrap{r,c,X)(b) <* b = (0, (r,c» V 30<i,j<b {b = A X(i,j)) 

Given these two functions, we want F p (i,X) to interpret the LAp function p(i,X). Let 
r x = fr(X) be the interpretation of r(A), and cx = fc(X) be the interpretation of c(X). 
For strings X that satisfy isMatrix2(X) and powers i > 0, 

F p (i,X) = Wrap(rx,cx,Pow2(max.(r x ,cx),i,Strip(X))) 

This is consistent with the LAp convention that for i > 1, p(i, A) retains the row and column 
dimensions of A, and is defined by raising the matrix A 1 to the power i and truncating excess 
rows or columns, where A 1 is the square matrix of dimension max{r(^4), c(A)} obtained from 
A by adding entries where needed. 

There is a special case: if i = 0, then LAP specifies that the zeroth power of a matrix 
A is the v(A) x r(A) identity matrix. 

F p {0,X) = Wrap{r x ,r x , Pow 2 {r x ,0, Strip{X))) 

If —iisMatrix2(X) then F p (i,X) codes the 0x0 matrix. 

Combining all these cases, we can bit-define F p as follows: 

F p (i,X)(b) o 

[isMatrix 2 (X) A i = A Wrap(r x ,rx, Pow2(r x ,0, Strip (X)))] 
V i > A Wrap(rx,cx, Pow2(m.&x(rx,cx),i, Strip(X)))^ 
W[^isMatrix 2 (X) A b = (0, (0, 0})] 
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4.2.4. Translating atomic formulas. We translate atomic formulas as follows: 

LAP V®L 

m =i n dcx n m a = 1 n a 

m < index n m<J < n<T 
t =ficld U t a <-t U a 

T = ma t rix (r(T) = x(U)Y A (c(T) = C (U)f f\ 

Vi,j < (|T CT | + |C/ CT |) (e(i,j,T) =e(i,j,[/)r 
The translation of T = ma trix U could be simplified to T a = U a for all LAP terms T and U 
other than matrix variables A, since all other terms translate to terms satisfying isMatrix2- 
However two distinct string variables A, B could nevertheless represent matrices that are 
equal according to the Matrix Equality Rule (I4.2p if they do not satisfy isMatrix2 ■ 

4.2.5. Provability is preserved. 

Theorem 4.1. If LAP proves a\, . . . , /3\, . . . , then V®L proves 
a<[,...,a% (31,..., PI- 

Proof. (For more details sec [Fon].) Since our translation of formulas and sequents is trans- 
parent to the logical connectives, the logical axioms and rule applications for LAp (the 
propositional part of Gentzen's system LK) translate to logical axioms and rule applica- 
tions for V@L. 

The nonlogical axioms A1,...,A36 for LAp are given in Appendix[B]and the nonlogical 
rules are (I4.ip (Induction) and (I4.2p (Matrix Equality). It suffices to show that the trans- 
lated axioms are theorems of V(BL, and for each of the rules, if the translated hypotheses 
are theorems of V®L then so is the translated conclusion. 

The two rules are easily handled. Consider the Induction Rule (|4,ip . Since the trans- 
lation of each sequent of LAp is a sequent of quantifier-free formulas in V®L, the fact that 
this translated rule preserves theorems in V®L follows from the fact that V@L proves the 
induction scheme for T,q(Cf®l) formulas (part (iv) of Theorem I3.4f) . 

Now consider the Matrix Equality Rule (|4.2p . The fact that this translated rule pre- 
serves theorems in V(BL follows immediately from the way tlictt matrix equality — matrix is 
translated (Section ELX4"|) . 

Now consider the axioms of LAp. Axioms Al to A5 are equality axioms, and their 
translations mostly follow from the usual equality axioms. However = ma t r ix is not translated 
as equality, but it does translate to an equivalence relation, and the functions which take a 
matrix A as an argument (namely r, c, e, and p) depend only on the numbers of rows 
and columns of A, and the entries of A. Hence all translations of Al to A5 are theorems 
of V®L. 

Axioms A6 to A14 translate to simple properties of N under +, ■, and <, which are 
provable in V° (see Chapter III of [CN10J). A15, A16, and A17 translate to simple 
properties of N which are easily proved in V° from the definitions /di V , /rem, and f^. 

Axioms A18 to A26 are the axioms that define a field. Section 14.2.21 translates field 
terms to formulas (where _L represents and T represents 1). All these axioms translate 
into logical tautologies, so they are all trivially provable in V@L. For example 

A18 : — ^0/lAa + = a translates to 

A18 CT : (± O -.T) A ((X a (l, 1) ±) O X a (l, 1)) 
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Axiom A27 defines condfi e id> and its instances also translate into tautologies. 

Axioms A28 and A29 define the row, column, and entry functions r, c, e, and relate 
them to the lambda terms defining matrices. The translations of these follow easily from 
the definitions of f r , f c , F v , and are provable in V°. 

Axioms A30 to A34 define ^2(A) recursively by breaking the matrix A into four parts 
as illustrated in (|4,3p . Since the translation of XX A) is defined in terms of the parity 
function PAR, the following lemma is useful. It states that if two strings differ on exactly 
one bit, then they have opposite parities. 

Lemma 4.2. V°(2) proves: 
(X(k) o -Y(k))A 

(Vi <\X\ + \Y\i^k+* (X(i) <-> Y(i))) D (PAR(X) o -i PAR(Y)) 

Proof. To prove the lemma, recall (|3.ip that PAR(X) = Parity(\X\,X)(\X\). The proof 
in V°(2) proceeds by induction on the bits of the witness strings Parity (\X\,X) and 
Parity(\Y\,Y) computing the parities of X and Y. □ 

To prove the translations of axioms A28 and A29 in V°(2) we consider separately the 
two cases ^isMatrix2(A) and isMatrix2(A). The first case is trivial to prove because then 
A has zero rows and columns and it has no entries. So we may assume isMatrix2(A). In 
that case, according to Section T4.2.31 ^2(A) translates to —>PAR(A). 

Axiom A30 asserts ^2(A) = e(A, 1, 1) in case A has exactly one row and column. 
Since we may assume isMatrix2(A), the string A has at most two 1-bits: one is A(0, (1, 1)) 
specifying its dimension, and the other possibility is A(l, 1). With the help of Lemma l4.2( 
V°{2) proves A(l, 1) «-> ->PAR(A), as required. 

Axiom A31 states that if A has exactly one row and at least two columns, then Yl(^) = 
^2(B) + Ai c f£\, where B is the matrix obtained from A by deleting the last entry in row 
1. The string Y representing B differs in at most three bits from the string X representing 
A. Two of these are in row 0, since the dimensions of A and B are different, and the third 
possibility is X(l, c(A)) versus 5^(1, c(A)), where the latter is always 0. Hence V°(2) proves 
the translation of the axiom using three applications of Lemma 14.21 

Axiom A32 states that if A has only one column, then ^2(A) = ^2(A ). (See Section 
14.11 for the definition of A t , the transpose of A.) The translation is equivalent to 

f c (A) = 1 A isMatrix 2 (A) D (PAR(A) o PAR(B)) 

where B = F v (l, f r (A)) is the term translating A . The more general statement obtained 
by replacing the conclusion PAR(A) O PAR(B)) by asserting roughly that the parity of the 
first i rows of A equals the parity of the first i columns of B by induction on i, using Lemma 
14.21 The precise statement takes into account the bit in row of each matrix describing the 
dimension of the matrix. 

Axiom A33 asserts that if A has at least two rows and two columns, then Y2(A) is the 
the sum of the four submatrices pictured in fj4.3f) . The translation asserts that the parity of 
A is equivalent to the exclusive or of the parities of the four pieces. This is proved by double 
induction, first by considering just the first i rows of both sides, and for the induction step, 
to consider in addition the first j columns of row i + 1. 

Axiom A34 assert ^2(A) = in case A has rows or columns. Its translation is 
easily proved. 
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Axiom A35 is — > p(0, A) = I t /m. The term p(0, A) is translated F p (0, A), and the term 
I T (A) 1S translated F t <y (f r (A), f r (A)), where t is the term cond(i = j, 1, 0) (see the definition 
of 1^ in Section [4. ip . The symbol = in the axiom is really = m atrix> so the translated axiom 
asserts that the number of rows, number of columns, and entries of the translations of the 
two matrix terms, are all respectively equal. Proving this amounts to verifying that all the 
defined functions have their intended meanings. 

Axiom A36 is — > p(n + 1,A) = p(n, A) * A. The proof of the translation distinguishes 
the two cases ^isMatrix2(A) and isMatrix2(A). The former is easy, since both sides of the 
equation translate into the zero by zero matrix. V@L proves the second case by induction 
on n. Recall that p(n, A) is translated F p (n,A), where F p is defined in terms of P0W2, 
which in turn is defined in terms of PowSeq2 (13.3f) . whose value is a sequence of successive 
powers of its matrix argument. The induction proof involves verifying that the translated 
lambda term defining matrix product in Section 14.11 correctly corresponds to the definition 
(pHD defining Prod 2 in V®L. □ 



4.3. Interpreting LAp into V#L. Now we interpret the underlying 'field' in the seman- 
tics of LAp to be the ring Z of integers. Of course Z is not a field, so we cannot translate the 
axiom A21 a ^ -> a * (a" 1 ) = 1 for field inverses. However according to the footnote on 
page 283 of [SC04], this axiom is not used except in the proof of Lemma 3.1 and Theorem 
4.1. We do not need Lemma 3.1, and it is not hard to see that Theorem 4.1 does hold for in- 
tegral domains (the field inverse axiom can be replaced by the axiom a*b = 0,a ^ — > b = 
prohibiting zero divisors). It follows that LAP over integral domains proves the hard matrix 
identities in Definition 11.31 and the equivalence of (i), (ii), and (iii) in Section [TJ Thus our 
interpretation of LAp into V#L will show that Vj^L proves both of these results, and so 
Theorems 11.21 and 11.41 hold for V#L. 

The interpretation of LAp into V#L is similar to the interpretation into V®L. We first 
translate LAp into V#L, and then into V#L, using part (v) of Theorem 13.71 As before, 
it suffices to show how to translate terms and atomic formulas, since general formulas and 
sequents are translated by translating their atomic formulas. We do not translate terms or 
formulas involving the field inverse function t . 

We use t a and a a to denote the translation of LAp terms t and formulas a. 

Terms of type index (which do not involve matrices) are translated into number terms 
in exactly the same way as for V ®L (see Section 14.2. ip . 

Terms of type field (integers) are now translated into strings representing integers in 
binary, using the functions +z and Xz described in Section [3T2l 

For t and u terms of type field in LAp we interpret: 
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LAP V#L 

Ofieid the empty string 

lfieid the string "10", i.e., X such that X(i) f>i = l 

a X a field variables map to string variables 

t +ficld u t°~ +z u u 

t -field u t a +i {u a )' where (u a )' is identical to u a , except 

on the first bit 

t* Re i d u t a x z u a 

condfi e id(a, t, u) F^° nd (t a ,u a ) where for each formula tp, define: 

F^ ond (X,Y) = Z 
(<p f\ X = Z)V (-199 AY = Z) 



LAP terms T of type matrix are translated into string terms T a representing arrays 
of binary integers. As in the translation into V@L, if T has r rows and c columns, then 
T a (0,x) holds iff x = (r,c). For 1 < i < r and 1 < j < c, the binary integer entry is 
given by Row2(i, j,T a ), where R0W2 is defined in f|2.3|) . We need a formula isMatrix%(X) 
asserting that X is a valid encoding of a matrix of integers. This is defined similarly to 
isMatrixi for the translation into V®L. 

The following table shows how to translate terms involving matrices. 



LAP V#L 

A A matrix variables map to string variables 

r (T) /#cn 

c(T) f*(T°) 



e(T,m,n) F e (m a ,n a ,T a ) 
£(T) F E (T-) 

Xij(m,n,t) F*(m a ,n a ) 
p(m,T) F p # (m fJ ,T CT ) 

The functions above with superscripts # are similar to their prototypes for the translation 
into V®L. Thus ff and ft have the same definitions as f r and f c except isMatrix2 is 
replaced by isMatrixi- The entry function F e is defined by 

F e (i,j, X)(b) o X(i, (j, b)) A i > A j > A isMatrixz(X) 

The function Fj^(X) sums the integer entries of the matrix X, and is defined in terms 
of the iterated integer sum function Sumz(n,Y) mentioned in Section [3.21 (where Y is a 
rearrangement of the integer entries of the matrix X to form a linear list). 

The matrix term Xij(m,n,t} is interpreted by the V#L term FjZ(m cr ,n CT ). Here 
F^(x,y) has additional arguments corresponding to any free variables in t a other than 
the distinguished variables Its bit defining axiom is similar to that of Ff, used to 
translate lambda terms into V®L. 

F*(x,y)(b) O 

b = (0, (x,y)) V3i<x 3j<y 3k<b (i > A j > A b = (i, (j,k)) At r7 (i,j)(k)) 

The function Fp" interpreting matrix powering is defined similarly to F p for the V@L case, 
except that the functions Wrap and Strip need to be modified, P0W2 is replaced by Powz, 
and isMatrix2 is replaced by isMatrixi. 
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To complete the definition of the interpretation, we note that atomic formulas are 
translated as in Section 14.2.41 

4.3.1. Provability is preserved. 

Theorem 4.3. If LAp proves a±, . . . , a& — > /3i,...,/3g and none of the formulas ai or (3j 
contains a term of the form t~ x , then V#L proves af, . . . ,aZ — > /?f , • • • , P? , where now a 
refers to the translation of LAP into V#L. 

Proof. As in the case for V(&L, the proof involves showing that the translated axioms (with 
the axiom prohibiting zero divisors replacing the field inverse axiom A21) are theorems of 
V#L, and the translated rules preserve provability in Vj^L. (The proof for each axiom and 
rule is similar to the proof for the case of V(BL, except for the field axioms, which we discuss 
below.) This shows that the translation S a of a theorem S of LAp is provable in V#L, 
provided that the proof of S in LAp does not use the field inverse axiom A21. However 
Theorem 14.31 makes a stronger claim, namely that the proof of S can use the original axiom 
A21, provided that no term i _1 involving field inverses occurs in S. This holds because the 
theorems S are quantifier-free (semantically the free variables are universally quantified). 
Reasoning model-theoretically, if S holds for an arbitrary field, then it also holds for an 
arbitrary integral domain, because the domain can be extended to a field (the field of 
fractions) . 

The only LAp axioms whose translations in VjfL require proof methods significantly 
different than for V®L are the field axioms A18 to A26 (where we replace A21 a ^ —■ 
a*(a _1 ) = 1 by the integral domain axiom a*b = 0, a ^ — > b = 0). Showing that Vj^L (in 
fact VTC°) proves the commutative, associative, and distributive laws for the operations 
+z and Xz over the binary integers is tedious. This is not much different than proving that 
the operations +n and x ^ over binary natural numbers satisfy these laws. Some proof 
outlines and hints for the latter can be found in Section IX. 3. 6 of [ CNlOj . O 

5. Conclusion 

There are two general motivations for associating theories with complexity classes. The 
first is that of reverse mathematics [Sim99j : determining the complexity of concepts needed 
to prove various theorems, and in particular whether the correctness of an algorithm can 
be proved with concepts of complexity comparable to that of the algorithm. The second 
motivation comes from propositional proof complexity: determining the proof lengths of 
various tautology families in various proof systems. To explain these we start by stating 
the following: 

Open Problem 5.1. Can V®L prove the Cayley-Hamilton Theorem or the 'hard matrix 
identities' (Definition ll.3|) over TL^l Can VjfL prove these over Z? 

The importance of these questions stems partly from Theorem ll.2l which states that the 
theories prove the equivalence of the Cayley-Hamilton Theorem and two other properties 
of the determinant, and from Theorem 11.41 which states that the theories prove that the 
Cayley-Hamilton Theorem implies the 'hard matrix identities'. Theorems 11.21 and 11.41 follow 
from the corresponding theorems in LAp (for which Open Questions 15.11 also apply), and 
from our interpretations (Sections 14.2.51 and I4.3.ip . 
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It is possible that these questions (Open Questions 15.11) could be answered positively 
without answering the corresponding questions for LAP, by using methods not available 
to LAp. For example a proof in V®L might be able to take advantage of the simplicity 
of Z2, or a proof in V#L might be able use the algorithmic strength of integer matrix 
powering (as opposed to matrix powering over an unspecified field) to prove correctness 
of the dynamic programming algorithm for the determinant in |MV97j . This algorithm is 
based on a combinatorial characterization of det(A) using clow (closed walk) sequences in 
the edge-labeled graph specified by the matrix A. 

Over the field Z2 the hard matrix identities translate naturally to a family of prepo- 
sitional tautologies (and over Z they translate into another family of tautologies). The 
motivation for studying these identities is to give further examples of tautology families 
(like those in [BBP94| ) that might be hard for the class of propositional proof systems 
known as Frege systems. There is a close connection between the strength of a theory 
needed to prove these identities (or any T,q formula) and the strength of the propositional 
proof system required for their propositional translations to have polynomial size proofs. 
(Chapter 10 of [CNlOj gives propositional proof systems corresponding in this way to five 
of the theories in fll.4j> .) 

In particular, the fact that the hard matrix identities are provable in VP shows that 
their propositional translations have polynomial size proofs in Extended Frege systems. 
If the identities were provable in VNC 1 then the tautologies would have polynomial size 
Frege proofs. If the identities turn out to be provable in one of our new theories, then 
the tautologies would have polynomial size proofs in proof systems (yet to be defined) of 
strength intermediate between Frege and Extended Frege systems. 

Finally, we point out a lesser open problem. The main axiom for our new theory 
Vj^L asserts that integer matrix powers exist, where integers are represented in binary. As 
explained at the end of Section 12.31 integer matrix powering is complete for the complexity 
class DET even when restricted to 0-1 matrices, because the binary case is 74C°-reducible 
to the 0-1 case. It would be interesting to investigate whether the nontrivial reduction (see 
[Fon09| ) can be proved correct in the base theory VTC°, so that V#L could equivalently 
be axiomatized by the axiom for the 0-1 case rather than the binary case. 
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Appendix 



Appendix A. Details of Lemma 13.31 

Recall that in order to show that PowSeq 2 is £^ (/^-definable in V(BL, we need to show 
both: 

PowSeq* 2 (b, W 1 ,W 2 ,X)=Y++ 5 PowSeq% (6, Wi,W 2 ,X, Y) 

and 

V@L h V6VX, W h W 2 3\Y6 PowSeqt (6, W X ,W 2 ,X, Y) 
We introduce the AC functions max and S in order to simplify the definition of SpowSeq^ 

over V°(2), and then use proposition 12. 131 to obtain a provably equivalent Y,f (C\) formula. 

Since V°(2) is a conservative extension of V°(2), this suffices to show that PowSeq 2 is 

Sf (£^)-definable in V@L. In order to do so, we must establish a Sf {C-fac°(2)) formula 

equivalent to the desired formula 6p ow s eq *- 

Let the function max(n,W) yield the maximum number from a list of n numbers 

encoded in string W: 

max(n, W) = x o 3i < nVj < n, x = (W) % > (Wy 

The string function S can be bit-defined as follows. Consider two strings W\, repre- 
senting a list of b numbers, and X, representing a list of b matrices as above. The function 
S(b, Wi,X) returns the matrix with matrices (appropriately padded with zeroes) on the 
diagonal, and all other entries zero. Let m = max(b, W\). Let X[ be the m x m matrix Xj 
padded with columns and rows of zeroes: 



m 



tu. < 



Xi 







Then S(b, Y\ , X) is the string encoding the matrix: 



x' 2 











x> 



6-1 



All entries not in the matrices along the diagonal are 0. 
The string function S can be bit-defined: 

S(b,Wi,X)(i,j) <-> 3a < b, 3i' , f < (Wi) a , i<mAj<mA 

i = i' + mAj = j' + mA X [a] (i',f) 

By convention, the unspecified bits (i.e., bits b that are not pair numbers) are all zero. This 
bit-definition ensures that the string S(b,Wi,X)) is uniquely defined. 

We will use this matrix S(b,Wi, X) and the existence and uniqueness of its sequence 
of matrix powers to show the existence and uniqueness of the aggregate matrix powering 
function. 
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Let n max denote max(b, W\) and k max denote max(b, W2). Recall equation (13, 3D defining 
8powSeq 2 {n', k, X). By convention, the aggregate function of PowSeq 2 is denned as: 

PowSeq* 2 (b,W 1 ,W 2 ,X) = 7h|F|< (b, (k mSLX , (n max , n max })} A 

Vj < |y|,Vi < b, [(Y(j) d Poir(j)) A W?^^)*, (Wa)*, JfM.yW)] (A.l) 

The right-hand side of (jA.ip is the relation 5p ow s e q*(b, W±, W 2 ,X, Y); it has a provably 
equivalent £f (£^)-formula 3Z < t,ap 0W s e q*(b,Wi,W 2 , X,Y, Z), used as the definition for 
PowSeq 2 over F©L. 

It remains to prove the existence and uniqueness for PowSeq 2 (b, W\, W2,X). The func- 
tions max and S allow for a straightforward proof based on the existence and uniqueness 
of the string A = PowSeq 2 (b • n max , /c max , S(b, W\, X)), where (b,Wi) and 

k ma , x = max(b,W 2 ). 

We would like to define the string B = PowSeq 2 (b,Wi,W 2 , X) from A. Notice that B 
encodes a list of strings, each of which represents a power of the matrix S(b,Wi,X). The 
string A encodes nearly the same information, but in a different format: A is a list of lists, 
each of which encodes the powers of a matrix from the list X of matrices. 

Observe that: 



s(b,Wi,xy 



x' 2 







X'i 







x, 



b-l 



Also, 



xf 



m 



Tin < 



X) 




... 

Thus we can "look up" the required powers of each matrix. Let A and B be shorthand: 

A = PowSeq 2 (b-n Uiax ,k max ,S(b,W 1 ,X)) 
B = PowSeq^b, Wi,W 2 ,X) 
Then we can define PowSeq 2 from PowSeq 2 as follows. 

sMW(ij) o m<bAp<(W 2 ) m Ai<(lfi) m Aj KiWi)™ A 

3m' < m, m' + 1 = m A vl^ (n max ■ m' + i, n max • m' + i) (A. 2) 

Here, m represents the number of the matrix in the list X, p represents the power of matrix 
X m , and i and j represent the row and column; thus the formula above defines the bit 
(Xm)(i, j) for all matrices X m in the list X. By shifting around the pieces of this formula 
and adding quantifiers for m, p, i, and j, it is clear that (1A.2I) can be translated into the 
appropriate form for a S^-definition of the graph of PowSeq 2 . 

Thus existence and uniqueness of PowSeq 2 (b,Wi,W 2 , X) follow from existence and 
uniqueness of PowSeq 2 (b ■ n max , fc max , S(b, W\, X). Since max and S are AC functions, 
they can be used without increasing the complexity of the definition (again, by use of 
proposition 12.131 as above). 
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Appendix B. Axioms for LAp 

Here we use the abbreviations given at the end of Section 14.11 such as A + B, A 1 etc. We 
use Aij for e(A,i,j). 
Equality Axioms 

These are the usual equality axioms, generalized to apply to the three-sorted theory 
LA. Here = can be any of the three equality symbols, x, y, z are variables of any of the three 
sorts (as long as the formulas are syntactically correct). In A4, the symbol / can be any 
of the nonconstant function symbols of LAp. However A5 applies only to <, since this in 
the only predicate symbol of LAp other than =. 
Al — > x = x 
A2 x = y — > y = x 
A3 (x = y Ay = z) — > x = z 
A4 xi = yi, ...,x n = y n -> fx\...x n = fy\...y n 
A5 h = jt,i 2 = j2,k < i2 -> ji < 32 

Axioms for indices 

A6 -> i + 1 ji 

A7 —> i * (j + 1) = (i * j) + i 

A8 i + 1 = j + 1 -» i = j 

A9 —> i < i + j 

A10 ^i + = i 

All —>■ % < j,j <i 

A12 -> i + ( j + 1) = (t + j) + 1 

A13 i<j,j <i^i = j 

A14 -> i * = 

A15 i<j,i + k = j^j—i = k and i ^ j j—i = 

A16 i 7^ — )• rem(i, j) < j and j 7^ — >• i = j * div(i, j) + rem(z, j) 

A17 a — > cond(a, i, j) = i and ->a — >• cond(oi, i, j) = j 

Axioms for field elements 

A18 ^0/lAa + = a 
A19 -> a+ (-a) = 
A20 -» 1 * a = a 
A2ll]a/0^a*(a- 1 ) = 1 
A22 -> a + b = b + a 
A23 — s>a*6 = 6*a 
A24 -> a + (6 + c) = (a + 6) + c 
A25 — > a * (b * c) = (a * b) * c 
A26 -^a*(b + c) = a*b + a*c 

A27 a — > cond(a, a,b) = a and ->a — >■ cond(a, a,b) = b 

Axioms for matrices 



A21 can be replaced by a* b = 0, a 7^ — 5*6 = for the purpose of proving Theorems 11.21 and ll.4l 
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Axiom A28 states that e(A,i,j) is zero when i,j are outside the size of A. Axiom A29 
defines the behavior of constructed matrices. Axioms A30-A33 define the function E 
recursively by first defining it for row vectors, then column vectors (recall A 1 is the transpose 
of A), and then in general using the decomposition (|4.4p . Finally, axiom A34 takes care of 
empty matrices. 

A28 (i = Vr(A) <iVj = V c(A) < j) -> e(A,i,j) = 
A29 — > r(\ij{m,n,t}) = m and — > c(Xij(m,n,t)) = n and 
1 < i, i < m, 1 < j, j < n — > e(\ij{m, n, t),i,j) = t 
A30 r(A) = 1, c(A) = 1 -► E(A) = e(A, 1, 1) 

A31 r(A) = 1,1< c(A) -> E(A) = E(Aij(l,c(A) - 1, Ay)) + A lc(A) 
A32 c(A) = 1 -> E(A) = E(A*) 

A33 1 < r(A), 1 < c(A) -)■ E(A) = e(A, 1, 1) + E(i?(A)) + E(S(A)) + E(M(A)) 
A34 r(A) = V c(A) = -> EA = 

Axioms for matrix powering 

A350 -> p(0, A) = J r(A) 

A36 -> p(n + 1, A) = p(n, A) * A. 

2 This version is from page 45 of |SK01] 
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